AIbase
Home
AI Tools
AI Models
MCP
AI NEWS
EN
Model Selection
Tags
Multi-GPU efficient deployment

# Multi-GPU efficient deployment

Deepseek R1 AWQ
MIT
AWQ quantized version of DeepSeek R1 model, optimized for float16 overflow issues and supports efficient inference deployment
Large Language Model Transformers Supports Multiple Languages
D
cognitivecomputations
30.46k
77
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
English简体中文繁體中文にほんご
© 2025AIbase